智能论文笔记

AMMASurv: Asymmetrical Multi-Modal Attention for Accurate Survival Analysis with Whole Slide Images and Gene Expression Data

Ruoqi Wang , Ziwang Huang , Haitao Wang , Hejun Wu

分类：计算机视觉 | 人工智能

2021-08-28

使用多模态数据，例如整个幻灯片图像（WSIS）和基因表达数据的生存分析可以导致更准确的生存预测。以前的多模态生存模型无法有效地挖掘每个模态内的内在信息。此外，以前的方法将来自不同模式的信息视为类似的重要性，因此它们不能灵活地利用模态之间的潜在连接。为了解决上述问题，我们提出了一种新的不对称多模态方法，称为AMMASURV。与以前的作品不同，AMMASURV可以有效地利用每个模式内的内在信息，并灵活地适应不同重要性的模式。令人鼓舞的实验结果表明了我们对其他最先进的方法的方法的优越性。

translated by 谷歌翻译

Private Convex Optimization in General Norms

Sivakanth Gopi , Yin Tat Lee , Daogao Liu , Ruoqi Shen , Kevin Tian

分类：机器学习 | (统计)机器学习

2022-07-18

我们提出了一个新的框架，用于对凸函数的差异私有优化，这些功能是任意规范$ \ normx {\ cdot} $中的Lipschitz。我们的算法基于一种正规的指数机制，该机制从密度$ \ propto \ exp（-k（f+\ mu r））$中进行样品，其中$ f $是经验损失，$ r $是一种常规化器，它与强烈的convex convex converize尊重$ \ normx {\ cdot} $，将\ cite {gll22}的最新作品推广到非Euclidean设置。我们表明，这种机制可以满足高斯差异隐私，并通过使用凸几何形状的本地化工具来解决DP-MER（经验风险最小化）和DP-SCO（随机凸优化）。我们的框架是第一个在一般规范空间中适用于私有凸优化的框架，并直接恢复了镜下下降的非私有SCO率，作为隐私参数$ \ eps \ to \ infty $。作为应用程序，对于LipsChitz优化了$ \ ell_p $ norms for（1，2）$中的所有$ p \ norms，我们获得了第一个最佳隐私性权衡权衡；对于$ p = 1 $，我们提高了最近的作品\ cite {asifkt21，bassilygn21}获得的权衡，至少通过对数因素。我们的$ \ ell_p $ norm和schatten- $ p $规范优化框架与多项式时间采样器相辅相成，我们的查询复杂性明确绑定。

translated by 谷歌翻译

Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Better Anomaly Detection

Xincheng Yao , Chongyang Zhang , Ruoqi Li

分类：计算机视觉

2022-07-04

大多数异常检测算法主要集中于建模正常样品的分布并将异常视为异常值。但是，由于缺乏对异常的知识，该模型的判别性能可能不足。因此，应尽可能利用异常。但是，在训练过程中利用一些已知的异常情况可能会导致另一个问题，即模型可能会受到已知异常的偏见，并且未能概括地看不见异常。在本文中，我们旨在利用一些现有的异常情况，具有精心设计的明确指导的半孔学习策略，这可以增强可区分性，同时减轻由于已知异常不足引起的偏见问题。我们的模型基于两个核心设计：首先，找到一个明确的分离边界作为进一步的对比度学习的指导。具体而言，我们采用归一化流程来学习正常特征分布，然后找到一个明确的分离边界，靠近分布边缘。所获得的显式和紧凑的分离边界仅依赖于正常特征分布，因此可以减轻少数已知异常引起的偏置问题。其次，在显式分离边界的指导下学习更多的判别特征。开发了边界引导的半孔损耗，以将正常特征融合在一起，同时将异常特征推开以外的分离边界以外的边界区域。通过这种方式，我们的模型可以形成更明确，更歧视性的决策边界，以为已知和看不见的异常取得更好的结果，同时还保持高训练效率。对广泛使用的MVTECAD基准进行的广泛实验表明，该方法可实现新的最新结果，其性能为98.8％的图像级AUROC和99.4％的像素级AUROC。

translated by 谷歌翻译

Data Augmentation as Feature Manipulation

Ruoqi Shen , Sébastien Bubeck , Suriya Gunasekar

分类：机器学习 | (统计)机器学习

2022-03-03

数据增强是机器学习管道的基石，但其理论基础尚不清楚。它只是人为增加数据集大小的一种方法吗？还是鼓励模型满足某些不变性？在这项工作中，我们考虑了另一个角度，我们研究了数据增强对学习过程动态的影响。我们发现，数据增强可以改变各种功能的相对重要性，从而有效地使某些信息性但难以学习的功能更有可能在学习过程中捕获。重要的是，我们表明，对于非线性模型，例如神经网络，这种效果更为明显。我们的主要贡献是对Allen-Zhu和Li [2020]最近提出的多视图数据模型中两层卷积神经网络的学习动态数据的详细分析。我们通过进一步的实验证据来补充这一分析，证明数据增加可以看作是特征操纵。

translated by 谷歌翻译

Analysis of Langevin Monte Carlo from Poincaré to Log-Sobolev

Sinho Chewi , Murat A. Erdogdu , Mufan Bill Li , Ruoqi Shen , Matthew Zhang

分类： (统计)机器学习

2021-12-23

经典地，连续时间兰富文队扩散在唯一的假设下迅速迅速迅速迅速迅速，以至于$ \ PI $满足POINCAR的不平等。使用这一事实来为离散时间Langevin Monte Carlo（LMC）算法提供保证，因此由于需要与Chi Squared或R \'enyi分歧的需要，并且在很大程度上主要重点关注日志凹形目标。在这项工作中，我们为LMC提供了第一个收敛保证，假设$ \ PI $满足Lata {\ l} a - oleszkiewicz或修改的log-sobolev不等式，它在Poincar \ e和log-sobolev设置之间插值。与现有作品不同，我们的结果允许弱滑性，并且不需要凸起或耗散条件。

translated by 谷歌翻译

Near-Optimal Randomized Exploration for Tabular MDP

Zhihan Xiong , Ruoqi Shen , Qiwen Cui , Simon S. Du

分类：机器学习

2021-02-19

我们在加固学习中使用汤普森采样（TS） - 样算法中的随机价值函数研究探索。这种类型的算法享有有吸引力的经验性能。我们展示当我们使用1）每一集中的单个随机种子，而2）伯尼斯坦型噪声幅度，我们获得了最坏的情况$ \ widetilde {o}左（h \ sqrt {sat} \右）$遗憾绑定了焦点时间 - 不均匀的马尔可夫决策过程，其中$ S $是国家空间的大小，$ a $的是行动空间的大小，$ h $是规划地平线，$ t $是互动的数量。这种绑定的多项式基于随机值函数的TS样算法的所有现有界限，并且首次匹配$ \ Omega \左（H \ SQRT {SAT}右）$下限到对数因子。我们的结果强调随机勘探可以近乎最佳，这是以前仅通过乐观算法实现的。为了实现所需的结果，我们开发1）新的剪辑操作，以确保持续持续的概率和悲观的概率是较低的常数，并且2）用于分析估计误差的绝对值的新递归公式。后悔。

translated by 谷歌翻译

Cross Modal Transformer via Coordinates Encoding for 3D Object Dectection

Junjie Yan , Yingfei Liu , Jianjian Sun , Fan Jia , Shuailin Li , Tiancai Wang , Xiangyu Zhang

分类：计算机视觉

2023-01-03

In this paper, we propose a robust 3D detector, named Cross Modal Transformer (CMT), for end-to-end 3D multi-modal detection. Without explicit view transformation, CMT takes the image and point clouds tokens as inputs and directly outputs accurate 3D bounding boxes. The spatial alignment of multi-modal tokens is performed implicitly, by encoding the 3D points into multi-modal features. The core design of CMT is quite simple while its performance is impressive. CMT obtains 73.0% NDS on nuScenes benchmark. Moreover, CMT has a strong robustness even if the LiDAR is missing. Code will be released at https://github.com/junjie18/CMT.

translated by 谷歌翻译

A Survey On Few-shot Knowledge Graph Completion with Structural and Commonsense Knowledge

Haodi Ma , Daisy Zhe Wang

分类：自然语言处理 | 人工智能 | 机器学习

2023-01-03

Knowledge graphs (KG) have served as the key component of various natural language processing applications. Commonsense knowledge graphs (CKG) are a special type of KG, where entities and relations are composed of free-form text. However, previous works in KG completion and CKG completion suffer from long-tail relations and newly-added relations which do not have many know triples for training. In light of this, few-shot KG completion (FKGC), which requires the strengths of graph representation learning and few-shot learning, has been proposed to challenge the problem of limited annotated data. In this paper, we comprehensively survey previous attempts on such tasks in the form of a series of methods and applications. Specifically, we first introduce FKGC challenges, commonly used KGs, and CKGs. Then we systematically categorize and summarize existing works in terms of the type of KGs and the methods. Finally, we present applications of FKGC models on prediction tasks in different areas and share our thoughts on future research directions of FKGC.

translated by 谷歌翻译

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

Yue Han , Jiangning Zhang , Zhucun Xue , Chao Xu , Xintian Shen , Yabiao Wang , Chengjie Wang , Yong Liu , Xiangtai Li

分类：计算机视觉

2023-01-03

Few Shot Instance Segmentation (FSIS) requires models to detect and segment novel classes with limited several support examples. In this work, we explore a simple yet unified solution for FSIS as well as its incremental variants, and introduce a new framework named Reference Twice (RefT) to fully explore the relationship between support/query features based on a Transformer-like framework. Our key insights are two folds: Firstly, with the aid of support masks, we can generate dynamic class centers more appropriately to re-weight query features. Secondly, we find that support object queries have already encoded key factors after base training. In this way, the query features can be enhanced twice from two aspects, i.e., feature-level and instance-level. In particular, we firstly design a mask-based dynamic weighting module to enhance support features and then propose to link object queries for better calibration via cross-attention. After the above steps, the novel classes can be improved significantly over our strong baseline. Additionally, our new framework can be easily extended to incremental FSIS with minor modification. When benchmarking results on the COCO dataset for FSIS, gFSIS, and iFSIS settings, our method achieves a competitive performance compared to existing approaches across different shots, e.g., we boost nAP by noticeable +8.2/+9.4 over the current state-of-the-art FSIS method for 10/30-shot. We further demonstrate the superiority of our approach on Few Shot Object Detection. Code and model will be available.

translated by 谷歌翻译

RELIANT: Fair Knowledge Distillation for Graph Neural Networks

Yushun Dong , Binchi Zhang , Yiling Yuan , Na Zou , Qi Wang , Jundong Li

分类：机器学习

2023-01-03

Graph Neural Networks (GNNs) have shown satisfying performance on various graph learning tasks. To achieve better fitting capability, most GNNs are with a large number of parameters, which makes these GNNs computationally expensive. Therefore, it is difficult to deploy them onto edge devices with scarce computational resources, e.g., mobile phones and wearable smart devices. Knowledge Distillation (KD) is a common solution to compress GNNs, where a light-weighted model (i.e., the student model) is encouraged to mimic the behavior of a computationally expensive GNN (i.e., the teacher GNN model). Nevertheless, most existing GNN-based KD methods lack fairness consideration. As a consequence, the student model usually inherits and even exaggerates the bias from the teacher GNN. To handle such a problem, we take initial steps towards fair knowledge distillation for GNNs. Specifically, we first formulate a novel problem of fair knowledge distillation for GNN-based teacher-student frameworks. Then we propose a principled framework named RELIANT to mitigate the bias exhibited by the student model. Notably, the design of RELIANT is decoupled from any specific teacher and student model structures, and thus can be easily adapted to various GNN-based KD frameworks. We perform extensive experiments on multiple real-world datasets, which corroborates that RELIANT achieves less biased GNN knowledge distillation while maintaining high prediction utility.

translated by 谷歌翻译